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51 
101 
151 
201 

I 

251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



GAATTCCGGC CTGCTGCCGG GCCGCCCGAC CCGCCGGGCC ACACGGCAGA 
GCCGCCTGAA GCCCAGCGCT GAGGCTGCAC TTTTCCGAGG GCTTGACATC 
AGGGTCTATG TTTAAGTCTT AGCTCTTGCT TACAAAGACC ACGGCAATTC 
CTTCTCTGAA GCCCTCGCAG CCCCACAGCG CCCTCGCAGC CCCAGCCTGC 
CGCCTACTGC CCAGCAATGC CCTCCAGCGG CCCCGGGGAC ACCAGCAGCT 
CCTCTCTGGA GCGGGAGGAT GATCGAAAGG AAGGAGAGGA ACAGGAGGAG 
AACCGTGGCA AGGAAGAGCG CCAGGAGCCC AGCGCCACGG CCCGGAAGGT 
GGGGAGGCCT GGCCGGAAGC GCAAGCACCC ACCGGTGGAA AGCAGTGACA 
CCCCCAAGGA CCCAGCAGTG ACCACCAAGT CTCAGCCCAT GGCCCAGGAC 
TCTGGCCCCT CAGATCTGCT ACCCAATGGA GACTTGGAGA AGCGGAGTGA 
ACCCCAACCT GAGGAGGGGA GCCCAGCTGC AGGGCAGAAG GGTGGGGCCC 
CAGCTGAAGG AGAGGGAACT GAGACCCCAC CAGAAGCCTC CAGAGCTGTG 
GAGAATGGCT GCTGTGTGAC CAAGGAAGGC CGTGGAGCCT CTGCAGGAGA 
GGGCAAAGAA CAGAAGCAGA CCAACATCGA ATCCATGAAA ATGGAGGGCT 
CCCGGGGCCG ACTGCGAGGT GGCTTGGGCT GGGAGTCCAG CCTCCGTCAG 
CGACCCATGC CAAGACTCAC CTTCCAGGCA GGGGAGGCCT ACTACATCAG 
CAAACGGAAA CGGGATGAGT GGCTGGCACG TTGGAAAAGG GAGGCTGAGA 
AGAAAGCCAA GGTAATTGCA GTAATGAATG CTGTGGAAGA GAACCAGGCC 
TCTGGAGAGT CTCAGAAGGT GGAGGAGGCC AGCCCTCCTG CTGTGCAGCA 
GCCCACGGAC CCTGCTTCTC CGACTGTGGC CACCACCCCT GAGCCAGTAG 
GAGGGGATGC TGGGGACAAG AATGCTACCA AAGCAGCCGA CGATGAGCCT 
GAGTATGAGG ATGGCCGGGG CTTTGGCATT GGAGAGCTGG TGTGGGGGAA 
ACTTCGGGGC TTCTCCTGGT GGCCAGGCCG AATTGTGTCT TGGTGGATGA FIG. 1 A"" 1 
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1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 



CAGGCCGGAG 


CCGAGCAGCT 


GAAGGCACTC 


GCTGGGTCAT 


GTGGTTCGGA 


GATGGCAAGT 


TCTCAGTGGT 


GTGTGTGGAG 


AAGCTCATGC 


CGCTGAGCTC 


CTTCTGCAGT 


GCATTCCACC 


AGGCCACCTA 


CAACAAGCAG 


CCCATGTACC 


GCAAAGCCAT 


CTACGAAGTC 


CTCCAGGTGG 


CCAGCAGCCG 


TGCCGGGAAG 


CTGTTTCCAG 


CTTGCCATGA 


CAGTGATGAA 


AGTGACAGTG 


GCAAGGCTGT 


GGAAGTGCAG 


AACAAGCAGA 


TGATTGAATG 


GGCCCTCGGT 


GGCTTCCAGC 


CCTCGGGTCC 


TAAGGGCCTG 


GAGCCACGAG 


AAGAAGAGAA 


GAATCCTTAC 


AAGGAAGTTT 


ACACCGACAT 


GTGGGTGGAG 


CCTGAAGCAG 


CTGCTTACGC 


CCCACCCCCA 


CCAGCCAAGA 


AACCCAGAAA 


GAGCACAACA 


GAGAAACCTA 


AGGTCAAGGA 


GATCATTGAT 


GAGCGCACAA 


GGGAGCGGCT 


GGTGTATGAG 


GTGCGCCAGA 


AGTGCAGAAA 


CATCGAGGAC 


ATTTGTATCT 


CATGTGGGAG 


CCTCAATGTC 


ACCCTGGAGC 


ACCCACTCTT 


CATTGGAGGC 


ATGTGCCAGA 



ACTGTAAGAA CTGCTTCTTG GAGTGTGCTT ACCAGTATGA CGACGATGGG 
TACCAGTCCT ATTGCACCAT CTGCTGTGGG GGGCGTGAAG TGCTCATGTG 
TGGGAACAAC AACTGCTGCA GGTGCTTTTG TGTCGAGTGT GTGGATCTCT 
TGGTGGGGCC AGGAGCTGCT CAGGCAGCCA TTAAGGAAGA CCCCTGGAAC 
TGCTACATGT GCGGGCATAA GGGCACCTAT GGGCTGCTGC GAAGACGGGA 
AGACTGGCCT TCTCGACTCC AGATGTTCTT TGCCAATAAC CATGACCAGG 
AATTTGACCC CCCAAAGGTT TACCCACCTG TGCCAGCTGA GAAGAGGAAG 
CCCATCCGCG TGCTGTCTCT CTTTGATGGG ATTGCTACAG GGCTCCTGGT 
GCTGAAGGAC CTGGGCATCC AAGTGGACCG CTACATTGCC TCCGAGGTGT 
GTGAGGACTC CATCACGGTG GGCATGGTGC GGCACCAGGG AAAGATCATG 
TACGTCGGGG ACGTCCGCAG CGTCACACAG AAGCATATCC AGGAGTGGGG 
CCCATTCGAC CTGGTGATTG GAGGCAGTCC CTGCAATGAC CTCTCCATTG FIG. 1 A"~2 
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'l£»»2S^ 2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 
3351 
3401 
3451 
3501 



TCAACCCTGC 


A.AA/% A A AA^^ A 

CCGCAAGGGA 


CTTTATGAGG 


GTACTGGCCG 


CCTCTTCTTT 


GAGTTCTACC 


AAATAATAA A 

GCCTCCTGCA 


TGATGCGCGG 


CCCAAGGAGG 


GAGATGATCG 


CCCCTTCTTC 


TAAA^AT T T A 

TGGCTCTTTG 


k /\ ■ »X/NTrt/NX 

AGAATGTGGT 


GGCCATGGGC 


/>TXA^T/^A/^A 

GTTAGTGACA 




A T A A A A A T T T 

CTCGCGATTT 


CTTGAGTCTA 


ACCCCGTGAT 


GATTGACGCC 


PMQkAQlGl 


ATA^\^^\A A A A 

CTGCTGCACA 


A A AAA A A A A T 

CAGGGCCCGT 


TACTTCTGGG 


GTAACCTTCC 


TGGCATGAAC 


■ A A AA^ T T A A 

AGGCCTTTGG 


CATCCACTGT 


/\ A A X/^ A X A A 

GAATGATAAG 


CTGGAGCTGC 


AAGAGTGTCT 


A A A A A A AAA A 

GGAGCACGGC 


AGAATAGCCA 


AGTTCAGCAA 


AGTGAGGACC 


ATTACCACCA 


AAT/\A A A A T A 

GGTCAAACTC 


x»Ta »*/^/^A/^ 

TATAAAGCAG 


GGCAAAGACC 


AGCATTTCCC 


CGTCTTCATG 


A A A^V A ^\ A A 

AACGAGAAGG 


A Art A /\ A TrtrtX 

AGGACATCCT 


GTGGTGCACT 


A A A Tr*/^ AAA 

GAAATGGAAA 


AAA T A T T" ^ A A 

GGGTGTTTGG 


A T" T" A A A A A T A 

CTTCCCCGTC 


CACTACACAG 


A /V* T X A A 

ACGTCTCCAA 


CATGAGCCGC 


TTA^\AA *^N^^A 

TTGGCGAGGC 


A A 1 A A A X A A T 

AGAGACTGCT 


GGGCCGATCG 


TGGAGCGTGC 


CGGTCATCCG 


CCACCTCTTC 


A^VT^\^\^N AT A A 

GCTCCGCTGA 


AGGAATATTT 


TGCTTGTGTG 


T A A P'f^f^ A ^ A T 

TAAGGGACAT 


GGGGGCAAAC 


T A A « A T A A T 

TGAAGTAGTG 


A XA A T A A A A A 

ATGATAAAAA 


ArtXXA A ArtA A 

AGTTAAACAA 


■ /\A A A/NA A A/^ 

ACAAACAAAC 


AAAAAACAAA 


■ ■ A A A ^\ A A 

ACAAAACAAT 


A A A A A ^\^\ A A 

AAAACACCAA 


rt A A rtrt A rt A rtrt 

GAACGAGAGG 


A A /N A A A A 

ACGGAGAAAA 


A T "T" A A A A A A A 

GTTCAGCACC 


A A A A A ^\ A ^\ 1 A 

CAGAAGAGAA 


AAA Art A A X X X 

AAAGGAATTT 


AAA r^/^ AAA /^/^ 

AAAGCAAACC 


A A A/^ kr^r* Ar*r* 

ACAGAGGAGG 


AAAACGCCGG 


AGGGCTTGGC 


AXTArtA A A Irt 

CTTGCAAAAG 


GGTTGGACAT 


A T/^ T or* Tr* a 

CATCTCCTGA 


GTTTTCAATG 


TTAACCTTCA 


rtXArtX A XrtX A 

GTCCTATCTA 


AAA ArtrtA AAA 

AAAAGCAAAA 


TAGGCCCCTC 


CCCTTCTTCC 


A A T A A^^ A T 

CCTCCGGTCC 


T" A A A A ^\A^\^\ A 

TAGGAGGCGA 


A rtxxxxxrtxx 

ACTTTTTGTT 


XX/NX A/^T/^XX 

TTCTACTCTT 


T T T A A A A A 

TTTCAGAGGG 


GTTTTCTGTT 


X/% X X X/>/X/^T X 

TGTTTGGGTT 


TTTGTTTCTT 


GCTGTGACTG 


AAACAAGAGA 


A T X A T T A A A A 

GTTATTGCAG 


Al A A AXrtArtX 

CAAAATCAGT 


AA/^AA/^AAAA 

AACAACAAAA 


a/^ta/^a a a T/^ 

AGTAGAAATG 


CCTTGGAGAG 


^\ AAA ^\ ^\ ^\ A A 

GAAAGGGAGA 


A A A A^\ A A A A X 

GAGGGAAAAT 


XrtXAXA A A A A 

TCTATAAAAA 


r\T T i A A ATAX 

CTTAAAATAT 


TGGTTTTTTT 


TTTTTTTCCT 


TTTCTATATA 


TCTCTTTGGT 


TGTCTCTAGC 


CTGATCAGAT 


AGGAGCACAA 


ACAGGAAGAG 


AATAGAGACC 


CTCGGAGGCA 


GAGTCTCCTC 


TCCCACCCCC 


CGAGCAGTCT 


CAACAGCACC 


ATTCCTGGTC 



jj APFROVcO \ ■ 
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Sheet 4 of 38 

■ad Uses Thereof 



3551 mZmkZk GAACCCAACT AGCAGCAGGG CGCTGAGAGA ACACCACACC 

3601 kGkCkCim TACAGTATTT CAGGTGCCTA CCACACAGGA AACCTTGAAG 

3651 AAAACCAGTT TCTAGAAGCC GCTGTTACCT CTTGTTTACA GTTTATATAT 

3701 ATATGATAGA TATGAGATAT ATATATATAA AAGGTACTGT TAACTACTGT 

3751 ACATCCCGAC TTCATAATGG TGCTTTCAAA ACAGCGAGAT GAGCAAAGAC 

3801 ATCAGCTTCC GCCTGGCCCT CTGTGCAAAG GGTTTCAGCC CAGGATGGGG 

3851 AGAGGGGAGC AGCTGGAGGG GGTTTTAACA AACTGAAGGA TGACCCATAT 

3901 CACCCCCCAC CCCTGCCCCA TGCCTAGCTT CACCTGCCAA AAAGGGGCTC 

3951 AGCTGAGGTG GTCGGACCCT GGGGAAGCTG AGTGTGGAAT TTATCCAGAC 

4001 TCGCGTGCAA TAACCTTAGA ATATGAATCT AAAATGACTG CCTCAGAAAA 

4051 ATGGCTTGAG AAAACATTGT CCCTGATTTT GAATTCGTCA GCCACGTTGA 

4101 AGGCCCCTTG TGGGATCAGA AATATTCCAG AGTGAGGGAA AGTGACCCGC 

4151 CATTAACCCC NCCTGGAGCA AATAAAAAAA CATACAAAAT GT 



FIG.1A-4 
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Mouse Dnml3b1 DNA Sequence 



1 

1 


r A A T irrfrr rrrrrrrr t t 
OAA 1 1 IrUjbO LbLUjObb 1 1 


AAbbbbbbbA 


Ab 1 AAAbb 1 A 


0000 AOOO AT 
bbbbAbbbA 1 




rrf^n^f^f^r^r a tat irrrr a a 


bbbbAbAb 1 b 


rrrrrrrrrr 

bbbbbbbbbb 


bbbbbbbAbb 


lUl 




bbbbbbb 1 Ab 


AbbbAbbb 1 b 


AOO AO AOO 00 
AbbAbnbbbb 


iDl 




bl IbbAAAbb 


1 bAbb i A 1 A 1 


AOOTTTOOAO 

AbbI 1 IbbAb 


/Ul 


AbbbbbbA 1 b 1 bbbb 1 bbbb 


r'ATrrATAOT 
bA 1 bbA 1 Ab i 


bbbi IbbbAb 


OA AATOOAOO 
bAAA 1 bbAbb 


ZJl 


nnnTjmjn APPAAAr'AAT 
bbb 1 I b 1 1 1 b AbbAAAbAA 1 


Chhrrr Ar* AO 
bAAbbbAbAb 


AOOAOAOATO 
Abb Ab AbA 1 b 


TO A A TO A AO A 
1 bAA 1 b AAbA 




AbAbbblbbb AbbbbblAlb 


Abb Ab 1 bbA 1 


TATOriTAAT 
lAlbbl lAAl 


OOOAAOTTOA 
bbbAAb 1 1 bA 


1 

JJI 


pTr* Ar^PAPTO pTr'AfAr^Arv^ 
bIbAbbAbIb blbAbAbAbb 


AAbbA 1 bb 1 b 


bb 1 bAbbbbb 


AO TOT TOO AO 
Ab 1 b 1 I bb Ab 


401 


bbAAIblbbA OAbAbbbAbl 


OTOPAOAOrA 

b I bbAbAbb A 


0 AO AOOAOAO 
bAbAbbAbAb 


bbbObAbb 1 b 


A 1 


AAbblbbbbb blblblAAbA 


OrTAOOTOTO 
bbbAbb 1 b 1 b 


OAOOOTTOTO 
bAbbbI lb lb 


A ATT AO AOOO 
AAI lAbAbbb 


oOl 


AbbAbAlbAb AbbAbAlbbA 


OAOAO AOATO 

bAbAb AbA 1 b 


A TO A AO T AO A 

A 1 b AAb 1 AbA 


TOATOOOA AT 
1 b A 1 bbb AA 1 


DDI 


fPPTr'TP ATA T Tr'T A A TCrT 

bbblblbAIA MblAAIbbb 


A A AOr'TOAOO 

AAAbb 1 bAbb 


OOTOAO AOOA 

bb 1 bAbAbbA 


AOOAOAOOAO 

AbbAbAbbAb 


bOl 


PA/vrTT'TPT pAAArr^r^p/^r* 
bAbbbbblbl bAAAbbbbbb 


b 1 b 1 bbbAAb 


OOO AO AT AOO 

bbbAbA 1 Abb 


A ATOOOAOOT 
AA 1 bbbAbb 1 


DDI 


rr^Arr'TTrr^A PArT'PAAAPA 
bbAbbI IbbA bAbbbAAAbA 


bbb 1 bbbbbA 


OA ATOAOOOO 

bAA 1 bAbbbb 


AOO T 0000 AO 
Abb 1 bbbbAb 


/Ol 


bbbbbbbAbb AlblbbAbbA 


b 1 Abbb 1 b 1 b 


OAOTTTOOOO 
bAb i 1 1 bbbb 


OTAOOAOOTO 
b 1 AbbAbb 1 b 


/ol 


\ bbbAbAbb 1 bbAbbA 1 bb 1 


OTTOAOOA AO 

bl IbAbbAAb 


OAOOOOATOO 
bAbbbbA 1 bb 


TOATOOOOTO 
1 bA 1 bbbb 1 b 


801 


CCAGCGTCGA CTTCATGGAA 


GAAGTGACAC 


CTAAGAGCGT 


CAGTACCCCA 


851 


TCAGTTGACT TGAGCCAGGA 


TGGAGATCAG 


GAGGGTATGG 


ATACCACACA 


901 


GGTGGATGCA GAGAGCAGAG 


ATGGAGACAG 


CACAGAGTAT 


CAGGATGATA 


951 


AAGAGTTTGG AATAGGTGAC 


CTCGTGTGGG 


GAAAGATCAA 


GGGCTTCTCC 


001 


TGGTGGCCTG CCATGGTGGT 


GTCCTGGAAA 


GCCACCTCCA 


AGCGACAGGC 



FIG.IB-I 
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1051 


CATGCCCGGA 


ATGCGCTGGG 


TACAGTGGTT 


TGGTGATGGC 


AAGTTTTCTG 


1101 


AGATCTCTGC 


TGACAAACTG 


GTGGCTCTGG 


GGCTGTTCAG 


CCAGCACTTT 


1151 


AATCTGGCTA 


CCTTCAATAA 


GCTGGTTTCT 


TATAGGAAGG 


CCATGTACCA 


1201 


CACTCTGGAG 


AAAGCCAGGG 


TTCGAGCTGG 


CAAGACCTTC 


TCCAGCAGTC 


1251 


CTGGAGAGTC 


ACTGGAGGAC 


CAGCTGAAGC 


CCATGCTGGA 


GTGGGCCCAC 


1301 


GGTGGCTTCA 


AGCCTACTGG 


GATCGAGGGC 


CTCAAACCCA 


ACAAGAAGCA 


1551 


ACCAGTGGTT 


AATAAGTCGA 


AGGTGCGTCG 


TTCAGACAGT 


AGGAACTTAG 


1401 


AACCCAGGAG 


ACGCGAGAAC 


AAAAGTCGAA 


GACGCACAAC 


CAATGACTCT 


1451 


GCTGCTTCTG 


AGTCCCCCCC 


ACCCAAGCGC 


CTCAAGACAA 


ATAGCTATGG 


1501 


CGGGAAGGAC 


CGAGGGGAGG 


ATGAGGAGAG 


CCGAGAACGG 


ATGGCTTCTG 


1551 


AAGTCACCAA 


CAACAAGGGC 


AATCTGGAAG 


ACCGCTGTTT 


GTCCTGTGGA 


1601 


AAGAAGAACC 


CTGTGTCCTT 


CCACCCCCTC 


TTTGAGGGTG 


GGCTCTGTCA 


1651 


GAGTTGCCGG 


GATCGCTTCC 


TAGAGCTCTT 


CTACATGTAT 


GATGAGGACG 


1701 


GCTATCAGTC 


CTACTGCACC 


GTGTGCTGTG 


AGGGCCGTGA 


ACTGCTGCTG 


1751 


TGCAGTAACA 


CAAGCTGCTG 


CAGATGCTTC 


TGTGTGGAGT 


GTCTGGAGGT 


1801 


GCTGGTGGGC 


GCAGGCACAG 


CTGAGGATGC 


CAAGCTGCAG 


GAACCCTGGA 


1851 


GCTGCTATAT 


GTGCCTCCCT 


CAGCGCTGCC 


ATGGGGTCCT 


CCGACGCAGG 


1901 


AAAGATTGGA 


ACATGCGCCT 


GCAAGACTTC 


TTCACTACTG 


ATCCTGACCT 


1951 


GGAAGAATTT 


GAGCCACCCA 


AGTTGTACCC 


AGCAATTCCT 


GCAGCCAAAA 


2001 


GGAGGCCCAT 


TAGAGTCCTG 


TCTCTGTTTG 


ATGGAATTGC 


AACGGGGTAC 


2051 


TTGGTGCTCA 


AGGAGTTGGG 


TATTAAAGTG 


GAAAAGTACA 


TTGCCTCCGA 


2101 


AGTCTGTGCA 


GAGTCCATCG 


CTGTGGGAAC 


TGTTAAGCAT 


GAAGGCCAGA 


2151 


TCAAATATGT 


CAATGACGTC 


CGGAAAATCA 


CCAAGAAAAA 


TATTGAAGAG 


2201 


TGGGGCCCGT 


TCGACTTGGT 


GATTGGTGGA 


AGCCCATGCA 


ATGATCTCTC 



FIG.1B-2 
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2251 


TAACGTCAAT 


CCTGCCCGCA 


AAGGTTTATA 


TGAGGGCACA 


GGAAGGCTCT 


2301 


TCTTCGAGTT 


TTACCACTTG 


CTGAATTATA 


CCCGCCCCAA 


GGAGGGCGAC 


2351 


AACCGTCCAT 


TCTTCTGGAT 


GTTCGAGAAT 


GTTGTGGCCA 


TGAAAGTGAA 


2401 


TGACAAGAAA 


GACATCTCAA 


GATTCCTGGC 


ATGTAACCCA 


GTGATGATCG 


2451 


ATGCCATCAA 


GGTGTCTGCT 


GCTCACAGGG 


CCCGGTACTT 


CTGGGGTAAC 


2501 


CTACCCGGAA 


TGAACAGGCC 


CGTGATGGCT 


TCAAAGAATG 


ATAAGCTCGA 


2551 


GCTGCAGGAC 


TGCCTGGAGT 


TCAGTAGGAC 


AGCAAAGTTA 


AAGAAAGTGC 


2601 


AGACAATAAC 


CACCAAGTCG 


AACTCCATCA 


GACAGGGCAA 


AAACCAGCTT 


2651 


TTCCCTGTAG 


TCATGAATGG 


CAAGGACGAC 


GTTTTGTGGT 


GCACTGAGCT 


2701 


CGAAAGGATC 


TTCGGCTTCC 


CTGCTCACTA 


CACGGACGTG 


TCCAACATGG 


2751 


GCCGCGGCGC 


CCGTCAGAAG 


CTGCTGGGCA 


GGTCCTGGAG 


TGTACCGGTC 


2801 


ATCAGACACC 


TGTTTGCCCC 


CTTGAAGGAC 


TACTTTGCCT 


GTGAATAGTT 


2851 


CTACCCAGGA 


CTGGGGAGCT 


CTCGGTCAGA 


GCCAGTGCCC 


AGAGTCACCC 


2901 


CTCCCTGAAG 


GCACCTCACC 


TGTCCCCTTT 


TTAGCTCACC 


TGTGTGGGGC 


2951 


CTCACATCAC 


TGTACCTCAG 


CTTTCTCCTG 


CTCAGTGGGA 


GCAGAGCCTC 


3001 


CTGGCCCTTG 


CAGGGGAGCC 


CCGGTGCTCC 


CTCCGTGTGC 


ACAGCTCAGA 


3051 


CCTGGCTGCT 


TAGAGTAGCC 


CGGCATGGTG 


CTCATGTTCT 


CTTACCCTGA 


3101 


AACTTTAAAA 


CTTGAAGTAG 


GTAGTAAGAT 


GGCTTTCTTT 


TACCCTCCTG 


3151 


AGTTTATCAC 


TCAGAAGTGA 


TGGCTAAGAT 


ACCAAAAAAA 


CAAACAAAAA 


3201 


CAGAAACAAA 


AAACAAAAAA 


AAACCTCAAC 


AGCTCTCTTA 


GTACTCAGGT 


3251 


TCATGCTGCA 


AAATCACTTG 


AGATTTTGTT 


TTTAAGTAAC 


CCGTGCTCCA 


3301 


CATTTGCTGG 


AGGATGCTAT 


TGTGAATGTG 


GGCTCAGATG 


AGCAAGGTCA 


3351 


AGGGGCCAAA 


AAAAATTCCC 


CCTCTCCCCC 


CAGGAGTATT 


TGAAGATGAT 


3401 


GTTTATGGTT 


TAAGTCTTCC 


TGGCACCTTC 


CCCTTGCTTT 


GGTACAAGGG 



FIG.1B-3 
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3451 CTGAAGTCCT GTTGGTCTTG TAGCATTTCC CAGGATGATG ATGTCAGCAG 

3501 GGATGACATC ACCACCTTTA GGGCTTTTCC CTGGCAGGGG CCCATGTGGC 

3551 TAGTCCTCAC GAAGACTGGA GTAGAATGTT TGGAGCTCAG GAAGGGTGGG 

3601 TGGAGTGGCC CTCTTCCAGG TGTGAGGGAT ACGAAGGAGG AAGCTTAGGG 

3651 AAATCCATTC CCCACTCCCT CTTGCCAAAT GAGGGGCCCA GTCCCCAACA 

3701 GCTCAGGTCC CCAGAACCCC CTAGTTCCTC ATGAGAAGCT AGGACCAGAA 

3751 GCACATCGTT CCCCTTATCT GAGCAGTGTT TGGGGAACTA CAGTGAAAAC 

3801 CTTCTGGAGA TGTTAAAAGC TTTTTACCCC ACGATAGATT GTGTTTTTAA 

3851 GGGGTGCTTT TTTTAGGGGC ATCACTGGAG ATAAGAAAGC TGCATTTCAG 

3901 AAATGCCATC GTAATGGTTT TTAAACACCT TTTACCTAAT TACAGGTGCT 

3951 ATTTTATAGA AGCAGACAAC ACTTCTTTTT ATGACTCTCA GACTTCTATT 

4001 TTCATGTTAC CATTTTTTTT GTAACTCGCA AGGTGTGGGC TTTTGTAACT 

4051 TCACAGGTGT GGGGAGAGAC TGCCTTGTTT CAACAGTTTG TCTCCACTGG 

4101 TTTCTAATTT TTAGGTGCAA AGATGACAGA TGCCCAGAGT TTACCTTTCT 

4151 GGTTGATTAA AGTTGTATTT CTCTAAAAAA AAAAAAAAAA AAAAA 



FIG.1B-4 
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Humon DNMT3A DNA Sequence 



1 






GCCGCGG 


CACCAGGGCG 


CGCAGCCGGG 


28 


CCGGCCCGAC 


CCCACCGGCC 


ATACGGTGGA 


GCCATCGAAG 


CCCCCACCCA 


78 


CAGGCTGACA 


GAGGCACCGT 


TCACCAGAGG 


GCTCAACACC 


GGGATCTATG 


128 


TTTAAGTTTT 


AACTCTCGCC 


TCCAAAGACC 


ACGATAATTC 


^^^TAA^NA AAA 

CTTCCCCAAA 


178 


GCCCAGCAGC 


CCCCCAGCCC 


CGCGCAGCCC 


CAGCCTGCCT 


CCCGGCGCCC 


228 


AGATGCCCGC 


CATGCCCTCC 


AGCGGCCCCG 


GGGACACCAG 


CAGCTCTGCT 


278 


GCGGAGCGGG 


AGGAGGACCG 


AAAGGACGGA 


GAGGAGCAGG 


AGGAGCCGCG 


328 


TGGCAAGGAG 


GAGCGCCAAG 


AGCCCAGCAC 


CACGGCACGG 


AAGGTGGGGC 


378 


GGCCTGGGAG 


GAAGCGCAAG 


CACCCCCCGG 


TGGAAAGCGG 


TGACACGCCA 


428 


AAGGACCCTG 


CGGTGATCTC 


CAAGTCCCCA 


TCCATGGCCC 


AGGACTCAGG 


478 


CGCCTCAGAG 


CTATTACCCA 


ATGGGGACTT 


GGAGAAGCGG 


AGTGAGCCCC 


528 


AGCCAGAGGA 


GGGGAGCCCT 


GCTGGGGGGC 


AGAAGGGCGG 


GGCCCCAGCA 


578 


GAGGGAGAGG 


GTGCAGCTGA 


GACCCTGCCT 


GAAGCCTCAA 


GAGCAGTGGA 


628 


AAATGGCTGC 


TGCACCCCCA 


AGGAGGGCCG 


AGGAGCCCCT 


GGAGAAGCGG 


678 


GCAAAGAACA 


GAAGGAGACC 


AACATCGAAT 


CCATGAAAAT 


GGAGGGCTCC 


728 


CGGGGCCGGC 


TGCGGGGTGG 


CTTGGGCTGG 


GAGTCCAGCC 


TCCGTCAGCG 


778 


GCCCATGCCG 


AGGCTCACCT 


TCCAGGCGGG 


GGACCCCTAC 


TACATCAGCA 


828 


AGCGCAAGCG 


GGACGAGTGG 


CTGGCACGCT 


GGAAAAGGGA 


GGCTGAGAAG 


878 


AAAGCCAAGG 


TCAGTGCAGG 


AATGAATGCT 


GTGGAAGAAA 


ACCAGGGGCC 


928 


CGGGGAGTCT 


CAGAAGGTGG 


AGGAGGCCAG 


CCCTCCTGCT 


GTGCAGCAGC 


978 


CCACTGACCC 


CGCATCCCCC 


ACTGTGGCTA 


CCACGCCTGA 


GCCCGTGGGG 


1028 


TCCGATGCTG 


GGGACAAGAA 


TGCCACCAAA 


GCAGGCGATG 


ACGAGCCAGA 
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1078 


GTACGAGGAC 


A A AAAA A A A^ 

GGCCGGGGCT 


TTGGCATTGG 


GGAGCTGGTG 


TGGGGGAAAC 


1128 


TGCGGGGCTT 


XX T A A T A A T A A 

CTCCTGGTGG 


CCAGGCCGCA 


TTGTGTCTTG 


GTGGATGACG 


1178 


GGCCGGAGCC 


A A A A A A A T A 

GAGCAGCTGA 


AGGCACCCGC 


TGGGTCATGT 


GGTTCGGAGA 


1228 


^^^^ AAA T T XX 

CGGCAAATTC 


T A A A T A A T A T 

TCAGTGGTGT 


GTGTTGAGAA 


GCTGATGCCG 


CTGAGCTCGT 


1278 


TTTGCAGTGC 


A T T A A A A A A A 

GTTCCACCAG 


GCCACGTACA 


ACAAGCAGCC 


CATGTACCGC 


1328 


AAA AAA A T^NT 

AAAGCCATCT 


a A.A A XN X\ T A^\T 

ACGAGGTCCT 


GCAGGTGGCC 


AGCAGCCGCG 


CGGGGAAGCT 


1378 


GTTCCCGGTG 


A ^\A 1 ^\ A 

TGCCACGACA 


A A A A T A A A A A 

GCGATGAGAG 


TGACACTGCC 


AAGGCCGTCG 


1428 


A ^X A f\ A A 

AGGTGCAGAA 


A A A XX XX X\ XX A 

CAAGCCCATG 


ATTGAATGGG 


CCCTGGGGGG 


CTTCCAGCCT 


1478 


TCTGGCCCTA 


A XX XX XX XX XX T ^X ^X A 

AGGGCCTGGA 


A A A A A A A A A A 

GCCACCAGAA 


^X A A A A A A A A A 

GAAGAGAAGA 


A TAAAT A /\ A A 

ATCCCTACAA 


1528 


AGAAGTGTAC 


A xxxxxx A Xx A ^rxx^r 

ACGGACATGT 


AAATAA A A AA 

GGGTGGAACC 


TA A AAA A A A T 

TGAGGCAGCT 


GCCTACGCAC 


1578 


CACCTCCACC 


AGCCAAAAAG 


^\ XX A XXXX XX A A X\ A 

CCCCGGAAGA 


A A AAA A AAA A 

GCACAGCGGA 


/X A 4/XA/XA A A 

GAAGCCCAAG 


1628 


GTCAAGGAGA 


A T T^X A TA A 

TTATTGATGA 


^\/XA A A ^\ A A A A 

GCGCACAAGA 


A A AA/\/XAT*^i% 

GAGCGGCTGG 


X/^ X A f\f\ A X 

TGTACGAGGT 


1678 


A AAAA A A A A A 

GCGGCAGAAG 


•f- XV XXXX ^\ A A XX A 

TGCCGGAACA 


T "T" XX A XX /X A ^X A T 

TTGAGGACAT 


ATA A A XAT/^/\ 

CTGCATCTCC 


TGTGGGAGCC 


1728 


^ A A A T A T T A A 

TCAATGTTAC 


AXXTAA A A A A A 

CCTGGAACAC 


AA AA T A T T A A 

CCCCTCTTCG 


TTGGAGGAAT 


GTGCCAAAAC 


1778 


TGCAAGAACT 


AXXir^TXXTAA A 

GCTTTCTGGA 


XXTATXXXXAT A A 

GTGTGCGTAC 


A A A T A AA A A A 

CAGTACGACG 


ACGACGGCTA 


1828 


^X A ^\ T XX ^P A XN 

CCAGTCCTAC 


T^X^X A XX^X A T ^X T 

TGCACCATCT 


XX XX TA TXXA^X^XA 

GCTGTGGGGG 


AAA T A A A A T A 

CCGTGAGGTG 


T ^ 1 X/^T/> /N/^ 

CTCATGTGCG 


1878 


GAAACAACAA 


^ XX T XX XX A AA 

CTGCTGCAGG 


T^X AT T T T XNA 

TGCTTTTGCG 


T A A A A T A T/^ *T 

TGGAGTGTGT 


/\/% »/^/\X/NXX/N 

GGACCTCTTG 


1928 


A ^ A A A A A A 

GTGGGGCCGG 


AAAAYAAAA A 

GGGCTGCCCA 


A A A A AAA A T T 

GGCAGCCATT 


k A A A A A A A A A 

AAGGAAGACC 


CCTGGAACTG 


1978 


*^ A XX A XX T* XX XX 

CTACATGTGC 


XX^XXX^X A ^X A A ^X^X 

GGGCACAAGG 


AT A AXXT A AAA 

GTACCTACGG 


A A T A A T A AAA 

GCTGCTGCGG 


AAA AA A A A A A 

CGGCGAGAGG 


2028 


ACTGGCCCTC 


CCGGCTCCAG 


ATGTTCTTCG 


CTAATAACCA 




2078 


TTTGACCCTC 


CAAAGGTTTA 


CCCACCTGTC 


CCAGCTGAGA 


AGAGGAAGCC 


2128 


CATCCGGGTG 


CTGTCTCTCT 


TTGATGGAAT 


CGCTACAGGG 


CTCCTGGTGG 


2178 


TGAAGGACTT 


GGGCATTCAG 


GTGGACCGCT 


ACATTGCCTC 


GGAGGTGTGT 
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2228 GAGGACTCCA TCACGGTGGG CATGGTGCGG CACCAGGGGA AGATCATGTA 

2278 CGTCGGGGAC GTCCGCAGCG TCACACAGAA GCATATCCAG GAGTGGGGCC 

2328 CATTCGATCT GGTGATTGGG GGCAGTCCCT GCAATGACCT CTCCATCGTC 

2378 AACCCTGCTC GCAAGGGCCT CTACGAGGGC ACTGGCCGGC TCTTCTTTGA 

2428 GTTCTACCGC CTCCTGCATG ATGCGCGGCC CAAGGAGGGA GATGATCGCC 

2478 CCTTCTTCTG GCTCTTTGAG AATGTGGTGG CCATGGGCGT TAGTGACAAG 

2528 AGGGACATCT CGCGATTTCT CGAGTCCAAC CCTGTGATGA TTGATGCCAA 

2578 AGAAGTGTCA GCTGCACACA GGGCCCGCTA CTTCTGGGGT AACCTTCCCG 

2628 GTATGAACAG GCCGTTGGCA TCCACTGTGA ATGATAAGCT GGAGCTGCAG 

2678 GAGTGTCTGG AGCATGGCAG GATAGCCAAG TTCAGCAAAG TGAGGACCAT 

2728 TACTACGAGG TCAAACTCCA TAAAGCAGGG CAAAGACCAG CATTTTCCTG 

2778 TCTTCATGAA TGAGAAAGAG GACATCTTAT GGTGCACTGA AATGGAAAGG 

2828 GTATTTGGTT TCCCAGTCCA CTATACTGAC GTCTCCAACA TGAGCCGCTT 

2878 GGCGAGGCAG AGACTGCTGG GCCGGTCATG GAGCGTGCCA GTCATCCGCC 

2928 ACCTCTTCGC TCCGCTGAAG GAGTATTTTG CGTGTGTGTA AGGGACATGG 

2978 GGGCAAACTG AGGTAGCGAC ACAAAGTTAA ACAAACAAAC AAAAAACACA 

3028 AAACATAATA AAACACCAAG AACATGAGGA TGGAGAGAAG TATCAGCACC 

3078 CAGAAGAGAA AAAGGAATTT AAAACAAAAA CCACAGAGGC GGAAATACCG 

3128 GAGGGCTTTG CCTTGCGAAA AGGGTTGGAC ATCATCTCCT GATTTTTCAA 

3178 TGTTATTCTT CAGTCCTATT TAAAAACAAA ACCAAGCTCC CTTCCCTTCC 

3228 TCCCCCTTCC CTTTTTTTTC GGTCAGACCT TTTATTTTCT ACTCTTTTCA 

3278 GAGGGGTTTT CTGTTTGTTT GGGTTTTGTT TCTTGCTGTG ACTGAAACAA 

3328 GAAGGTTATT GCAGCAAAAA TCAGTAACAA AAAATAGTAA CAATACCTTG 

3378 CAGAGGAAAG GTGGGAGGAG AGGAAAAAAG GGAAATTTTT AAAGAAATCT 
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3428 


ATATATTGGG 


TTGTTTTTTT 


TTTTGTTTTT 


TGTTTTTTTT 


TTTTGGGTTT 


3478 


TTTTTTTTTA 


CTATATATCT 


TTTTTTTGTT 


GTCTCTAGCC 


TGATCAGATA 


3528 


GGAGCACAAG 


CAGGGGACGG 


AAAGAGAGAG 


ACACTCAGGC 


GGCAGCATTC 


3578 


CCTCCCAGCC 


ACTGAGCTGT 


CGTGCCAGCA 


CCATTCCTGG 


TCACGCAAAA 


■^fi2fi 


PAGAArrrAG 


TTAGCAGCAG 


GGAGACGAGA 


ACACCACACA 


AGACATTTTT 




CTACAGTATT 


TCAGGTGCCT 


ACCACACAGG 


AAACCTTGAA 


GAAAATCAGT 


3728 


TTPTAGAAGG 


CGCTGTTACC 


TCTTGTTTAC 

1 W i i V III ft W 


AGTTTATATA 


TATATGATAG 


3778 


ATATGAGATA 


TATATATAAA 

1 r\ 1 ri 1 n 1 r\r\r\ 


AGGTACTGTT 


AACTACTGTA 


CAACCCGACT 


3828 


TGATAATGGT 


GCTTTCAAAC 


AGCGAGATGA 


GTAAAAACAT 


CAGCTTCCAC 


3878 


GTTGrrTTfT 


GGGGAAAGGG 


TTTCACCAAG 


GATGGAGAAA 


GGGAGACAGC 


3Q28 


TTGCAGATGG 


CGCGTTCTCA 


CGGTGGGCTC 


TTCCCCTTGG 


TTTGTAACGA 


3978 


AGTGAAGGAG 


GAGAACTTGG 


GAGCCAGGTT 


CTCCCTGCCA 


AAAAGGGGGC 


4028 


TAGATGAGGT 


GGTCGGGCCC 


GTGGACAGCT 


GAGAGTGGGA 


TTCATCCAGA 


4078 


CTCATGCAAT 


AACCCTTTGA 


TTGTTTTCTA 


AAAGGAGACT 


CCCTCGGCAA 


4128 


GATGGCAGAG 


GGTACGGAGT 


CTTCAGGCCC 


AGTTTCTCAC 


TTTAGCCAAT 


4178 


TCGAGGGCTC 


CTTGTGGTGG 


GATCAGAACT 


AATCCAGAGT 


GTGGGAAAGT 


4228 


GACAGTCAAA 


ACCCCACCTG 


GAGCAAATAA 


AAAAACATAC 


AAAACGTAAA 


4278 


AAAAAAAAAA 


AAAAAA 
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Human DNMT3B1 DMA Sequence: 



1 


GGCCGCGAAT 


TCGGCACGAG 


CCCTGCACGG 


CCGCCAGCCG 


GCCTCCCGCC 


51 


AGCCAGCCCC 


GACCCGCGGC 


TCCGCCGCCC 


AGCCGCGCCC 


CAGCCAGCCC 


101 


TGCGGCAGGA 


AAGCATGAAG 


GGAGACACCA 


GGCATCTCAA 


TGGAGAGGAG 


151 


GACGCCGGCG 


GGAGGGAAGA 


CTCGATCCTC 


GTCAACGGGG 


CCTGCAGCGA 


201 


CCAGTCCTCC 


GACTCGCCCC 


CAATCCTGGA 


GGCTATCCGC 


ACCCCGGAGA 


251 


TCAGAGGCCG 


AAGATCAAGC 


TCGCGACTCT 


CCAAGAGGGA 


GGTGTCCAGT 


301 


CTGCTAAGCT 


ACACACAGGA 


CTTGACAGGC 


GATGGCGACG 


GGGAAGATGG 


351 


GGATGGCTCT 


GACACCCCAG 


TCATGCCAAA 


GCTCTTCCGG 


GAAACCAGGA 


401 


CTCGTTCAGA 


AAGCCCAGCT 


GTCCGAACTC 


GAAATAACAA 


CAGTGTCTCC 


451 


AGCCGGGAGA 


GGCACAGGCC 


TTCCCCACGT 


TCCACCCGAG 


GCCGGCAGGG 


501 


CCGCAACCAT 


GTGGACGAGT 


CCCCCGTGGA 


GTTCCCGGCT 


ACCAGGTCCC 


551 


TGAGACGGCG 


GGCAACAGCA 


TCGGCAGGAA 


CGCCATGGCC 


GTCCCCTCCC 


601 


AGCTCTTACC 


TTACCATCGA 


CCTCACAGAC 


GACACAGAGG 


ACACACATGG 


651 


GACGCCCCAG 


AGCAGCAGTA 


CCCCCTACGC 


CCGCCTAGCC 


CAGGACAGCC 


701 


AGCAGGGGGG 


CATGGAGTCC 


CCGCAGGTGG 


AGGCAGACAG 


TGGAGATGGA 


751 


GACAGTTCAG 


AGTATCAGGA 


TGGGAAGGAG 


TTTGGAATAG 


GGGACCTCGT 


801 


GTGGGGAAAG 


ATCAAGGGCT 


TCTCCTGGTG 


GCCCGCCATG 


GTGGTGTCTT 


851 


GGAAGGCCAC 


CTCCAAGCGA 


CAGGCTATGT 


CTGGCATGCG 


GTGGGTCCAG 


901 


TGGTTTGGCG 


ATGGCAAGTT 


CTCCGAGGTC 


TCTGCAGACA 


AACTGGTGGC 


951 


ACTGGGGCTG 


TTCAGCCAGC 


ACTTTAATTT 


GGCCACCTTC 


AATAAGCTCG 


1001 


TCTCCTATCG 


AAAAGCCATG 


TACCATGCTC 


TGGAGAAAGC 


TAGGGTGCGA 


1051 


GCTGGCAAGA 


CCTTCCCCAG 


CAGCCCTGGA 


GACTCATTGG 


AGGACCAGCT 


1101 


GAAGCCCATG 


TTGGAGTGGG 


CCCACGGGGG 


CTTCAAGCCC 


ACTGGGATCG 


1151 


AGGGCCTCAA 


ACCCAACAAC 


ACGCAACCAG 


TGGTTAATAA 


GTCGAAGGTG 
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1201 


CGTCGTGCAG 


GCAGTAGGAA 


ATTAGAATCA 


AGGAAATACG 


AGAACAAGAC 


1251 


TCGAAGACGC 


ACAGCTGACG 


ACTCAGCCAC 


CTCTGACTAC 


TGCCCCGCAC 


1301 


CCAAGCGCCT 


CAAGACAAAT 


TGCTATAACA 


ACGGCAAAGA 


CCGAGGGGAT 


1351 


GAAGATCAGA 


GCGGAGAACA 


AATGGCTTCA 


GATGTTGCCA 


ACAACAAGAG 


1401 


CAGCCTGGAA 


GATGGCTGTT 


TGTCTTGTGG 


CAGGAAAAAC 


CCCGTGTCCT 


1451 


TCCACCCTCT 


CTTTGAGGGG 


GGGCTCTGTC 


AGACATGCCG 


GGATCGCTTC 


1501 


CTTGAGCTGT 


TTTACATGTA 


TGATGACGAT 


GGCTATCAGT 


CTTACTGCAC 


1551 


TGTGTGCTGC 


GAGGGCCGAG 


AGCTGCTGCT 


TTGCAGCAAC 


ACGAGCTGCT 


1601 


GCCGGTGTTT 


CTGTGTGGAG 


TGCCTGGAGG 


TGCTGGTGGG 


CACAGGCACA 


1651 


GCGGCCGAGG 


CCAAGCTTCA 


GGAGCCCTGG 


AGCTGCTACA 


TGTGTCTCCC 


1701 


GCAGCGCTGT 


CATGGCGTCC 


TGCGGCGCCG 


GAAGGACTGG 


AACGTGCGCC 


1751 


TGCAGGCCTT 


CTTCACCAGT 


GACACGGGGC 


TTGAATACGA 


AGCCCCCAAG 


1801 


CTGTACCCTG 


CCATTCCCGC 


AGCCC6AAGG 


CGGCCCATTC 


GAGTCCTGTC 


1851 


ATTGTTTGAT 


GGCATCGCGA 


CAGGCTACCT 


AGTCCTCAAA 


GAGTTGGGCA 


1901 


TAAAGGTAGG 


AAAGTACGTC 


GCTTCTGAAG 


TGTGTGAGGA 


GTCCATTGCT 


1951 


GTTGGAACCG 


TGAAGCACGA 


GGGGAATATC 


AAATACGTGA 


ACGACGTGAG 


2001 


GAAGATCAGA 


AAGAAAAATA 


TTGAAGAATG 


GGGCCCATTT 


GACTTGGTGA 


2051 


TTGGCGGAAG 


CCCATGCAAC 


GATCTCTCAA 


ATGTGAATCC 


AGCCAGGAAA 


2101 


GGCCTGTATG 


AGGGTACAGG 


CCGGCTCTTC 


TTCGAATTTT 


ACGAGCTGCT 


2151 


GAATTACTGA 


CGCCCCAAGG 


AGGGTGATGA 


CCGGCCGTTC 


TTCTGGATGT 


2201 


TTGAGAATGT 


TGTAGCCATG 


AAGGTTGGCG 


ACAAGAGGGA 


CATCTCACGG 


2251 


TTGCTGGAGT 


GTAATCCAGT 


GATGATTGAT 


GCCATCAAAG 


TTTCTGCTGC 


2301 


TCACAGGGCC 


CGATACTTCT 


GGGGCAACCT 


ACCCGGGATG 


AACAGGCCCG 


2351 


TGATAGCATC 


AAAGAATGAT 


AAACTCGAGC 


TGCAGGACTG 


CTTGGAATAC 


2401 


AATAGGATAG 


CCAAGTTAAA 


GAAAGTACAG 


ACAATAACCA 


CCAAGTCGAA 
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2451 


CTCGATCAAA 


CAGGGGAAAA 


ACCAACTTTT 


CCCTGTTGTC 


ATGAATGGCA 


2501 


AAGAAGATGT 


TTTGTGGTGC 


ACTGAGCTCG 


AAAGGATCTT 


TGGCTTTCCT 


2551 


GTGCACTACA 


CAGACGTGTC 


CAACATGGGC 


CGTGGTGCCC 


GCCAGAAGCT 


2601 


GCTGGGAAGG 


TCCTGGAGCG 


TGCCTGTCAT 


CCGACACCTC 


TTCGCCCCTC 


2651 


TGAAGGACTA 


CTTTGCATGT 


GAATAGTTCC 


AGCCAGGCCC 


CAAGCCCACT 


2701 


GGGGTGTGTG 


GCAGAGCCAG 


GACCCAGGAG 


GTGTGATTCC 


TGAAGGCATC 


2751 


CCCAGGCCCT 


GCTCTTCCTC 


AGCTGTGTGG 


GTCATACGGT 


GTACCTCAGT 


2801 


TCCCTCTTGC 


TCAGTGGGGG 


CAGAGCCACC 


TGACTCTTGC 


AGGGGTAGCC 


2851 


TGAGGTGCCG 


CCTCCTTGTG 


CACAAATCAG 


ACCTGGCTGC 


TTGGAGCAGC 


2901 


CTAACACGGT 


GCTCATTTTT 


TCTTCTCCTA 


AAACTTTAAA 


ACTTGAAGTA 


2951 


GGTAGCAACG 


TGGCTTTTTT 


TTTTTCCCTT 


CCTGGGTCTA 


CCACTCAGAG 


3001 


AAACAATGGC 


TAAGATACCA 


AAACCACAGT 


GCCGACAGCT 


CTCCAATACT 


3051 


CAGGTTAATG 


CTGAAAAATC 


ATCCAAGACA 


GTTATTGCAA 


GAGTTTAATT 


3101 


TTTGAAAACT 


GGGTACTGCT 


ATGTGTTTAC 


AGACGTGTGC 


AGTTGTAGGC 


3151 


ATGTAGCTAC 


AGGACATTTT 


TAAGGGCCCA 


GGATCGTTTT 


TTCCCAGGGC 


3201 


AAGCAGAAGA 


GAAAATGTTG 


TATATGTCTT 


TTACCCGGCA 


CATTCCCCTT 


3251 


GCCTAAATAC 


AAGGGCTGGA 


GTCTGCACGG 


GACCTATTAG 


AGTATTTTCC 


3301 


ACAATGATGA 


TGATTTCAGC 


AGGGATGACG 


TCATCATCAC 


ATTCAGGGCT 


3351 


ATTTTTTCCC 


CCACAAACCC 


AAGGGCAGGG 


GCCACTCTTA 


GCTAAATCCC 


3401 


TCCCCGTGAC 


TGCAATAGAA 


CCCTCTGGGG 


AGCTCAGGAA 


GGGGTGTGCT 


3451 


GAGTTCTATA 


ATATAAGCTG 


CCATATATTT 


TGTAGACAAG 


TATGGCTCCT 


3501 


CCATATCTCC 


CTCTTCCCTA 


GGAGAGGAGT 


GTGAAGCAAG 


GAGCTTAGAT 


3551 


AAGACACCCC 


CTCAAACCCA 


TTCCCTCTCC 


AGGAGACCTA 


CCCTCCAGAG 


3601 


GCACAGGTCC 


CCAGATGAGA 


AGTCTGCTAC 


CCTCATTTCT 


CATCTTTTTA 


3651 


CTAAACTCAG 


AGGCAGTGAC 


AGCAGTCAGG 


GACAGACATA 


CATTTCTCAT 
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3701 ACCTTCCCCA CATCTGAGAG ATGACAGGGA AAACTGCAAA GCTCGGTGCT 

3751 CCCTTTGGAG ATTTTTTAAT CCTTTTTTAT TCCATAAGAA GTCGTTTTTA 

3801 GGGAGAACGG GAATTCAGAC AAGCTGCATT TCAGAAATGC TGTCATAATG 

3851 GTTTTTAACA CCTTTTACTC TTCTTACTGG TGCTATTTTG TAGAATAAGG 

3901 AACAACGTTG ACAAGTTTTG TGGGGCTTTT TATACACTTT TTAAAATCTC 

3951 AAACTTCTAT TTTTATGTTT AACGTTTTCA TTAAAATTTT TTTGTAACTG 

4001 GAGCCACGAC GTAACAAATA TGGGGAAAAA ACTGTGCCTT GTTTCAACAG 

4051 TTTTTGCTAA TTTTTAGGCT GAAAGATGAC GGATGCCTAG AGTTTACCTT 

4101 ATGTTTAATT AAAATCAGTA TTTGTCTAAA AAAAAAAAAA AAAAA 



nG.1D-4 




1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
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Mouse Dnmt3a Protein 



MPSSGPGDTS 


SSSLEREDDR 


KEGEEQEENR 


GKEERQEPSA 


TARKVGRPGR 


KRKHPPVESS 


DTPKDPAVTT 


KSQPMAQDSG 


PSDLLPNGDL 


EKRSEPQPEE 


GSPAAGQKGG 


APAEGEGTET 


PPEASRAVEN 


GCCVTKEGRG 


ASAGEGKEQK 


QTNIESMKME 


GSRGRLRGGL 


GWESSLRQRP 


MPRLTFQAGD 


PYYISKRKRD 


EWLARWKREA 


EKKAKVIAVM 


NAVEENOASG 


ESQKVEEASP 


PAVQQPTDPA 


SPTVATTPEP 


VGGDAGDKNA 


TKAADDEPEY 


EOGRGFGIGE 


LVWGKLRGFS 


WWPGRIVSVW 


MTGRSRAAEG 


TRVWMAFGDG 


KFSWCVEKL 


MPLSSFCSAF 
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